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Abstract 

Background: To compare the distribution of the intrinsic molecular subtypes of breast cancer based on 
immunohistochemical profile in the five major geographic regions of Brazil, a country of continental 
dimension, with a wide racial variation of people. 

Methods: The study was retrospective observational. We classified 5,687 invasive breast cancers by molecular 
subtype based on immunohistochemical expression of estrogen-receptor (ER), progesterone-receptor (PR), human 
epidermal growth factor receptor 2 (HER2), and Ki-67 proliferation index. Cases were classified as luminal A (ER and/or 
PR positive and HER2 negative, Ki-67 < 14%), luminal B (ER and/or PR positive, HER2 negative, and Ki-67 > 14%), 
triple-positive (ER and/or PR positive and HER2 positive), HER2-enriched (ER and PR negative, and HER2- positive), 
and triple-negative (TN) (ER negative, PR negative, and HER2- negative). Comparisons of the ages of patients and 
molecular subtypes between different geographic regions were performed. 

Results: South and Southeast regions with a higher percentage of European ancestry and higher socioeconomic 
status presented with the highest proportion of luminal tumors. The North region presented with more aggressive 
subtypes (HER2-enriched and triple-negative), while the Central-West region predominated triple-positive carcinomas. 
The Northeast — a region with a high African influence — presented intermediate frequency of the different molecular 
subtypes. The differences persisted in subgroups of patients under and over 50 years. 

Conclusions: The geographic regions differ according to the distribution of molecular subtypes of breast cancer. 
However, other differences, beside those related to African ancestry, such as socioeconomic, climatic, nutritional, 
and geographic, have to be considered to explain our results. The knowledge of the differences in breast cancer 
characteristics among the geographic regions may help to organize healthcare programs in large countries like 
Brazil with diverse economic and race composition among different geographic regions. 
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Background 

Breast cancer remains a major health problem responsible 
for 458,400 deaths worldwide in 2008 [1]. The molecular 
intrinsic subtypes discovered in 1999 provided additional 
information on the clinical outcome independent of con- 
ventional prognosticators such as tumor size, tumor grade 
and lymph node status [2]. More importantly, various 
molecular subtypes respond differently to various chemo- 
therapy treatments, so an accurate subclassification is im- 
portant for deciding treatment options [3-6]. 
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Racial/ethnic differences have been observed among 
the different molecular subtypes [7-12] with the most 
documented being the relatively high incidence of basal 
type breast cancer (also called "triple negative") among 
African-American women compared to Caucasian fe- 
males [7,8,12,13]. Although non-white women, particu- 
larly of African descent, have a lower incidence of breast 
cancer, this particular race group present with more ag- 
gressive tumors and a higher mortality rate [13,14]. 

We investigated whether a similar effect of race could 
be present in Brazil, a country of a continental size with 
a wide variation in the distribution of people from vari- 
ous racial backgrounds in its five major geographic re- 
gions. Brazil is a large country with the fifth largest area 
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and the fifth largest population in the world with a total 
of more than 190 million inhabitants according to the 
2010 official census [15]. It has 26 states divided over five 
major geographic regions: North, Northeast, Central-West, 
Southeast and South. The social-economic situation and 
ethnic backgrounds vary greatly among these regions 
and include descendants from Amerindians, European 
immigrants, Asians, and Africans brought to the con- 
tinent as slaves in the 18"^ century. Although there is a 
tendency of one race predominant over another in 
these regions (for example, Amerindians in the North, 
blacks in the Northeast and Europeans in the South 
and Southeast), there are some ethnic variations in the 
Brazilian population mainly due to a high rate of inter- 
racial marriage. Anti-miscegenation and segregation laws 
have not been part of the Brazilian culture, so Brazil is a 
home to a population characterized by a color continuum 
between white and black races. This leads to a well-known 
difficulty in categorizing races because they lack a precise 
legal definition [16]. 

Previous studies actually demonstrated that in Brazil 
color, as determined by a physical evaluation, is a poor 
predictor of genetic ancestry estimated by molecular 
markers [17,18]. There remain, however, significant 
geographical differences in the racial makeup of the 
population, influenced by diverse geographic and economic 
characteristics of this country with the size comparable to 
a continent. The knowledge of possible differences in a 
large and ethnically complex country such as Brazil clearly 
would benefit not only the study of breast cancer in this 
country, but also the comprehension of the mechanisms 
involved in different molecular subtypes, not to mention 
the opportunity to develop more efficient strategies of 
prevention and early detection of breast cancer, particu- 
larly among the minorities. 

In Brazil, federal law regulates mammographic screening 
and it has been offered to all women over 50 years of age 
since 2009. The number of new cases of breast cancer in 
Brazil estimated in 2014 was 57,120 [19]. Currently, there is 
no data regarding the distribution of breast cancer molecu- 
lar subtypes in BrazO or even within individual states of this 
country. Our aim is to compare the distribution of the in- 
trinsic molecular subtypes by the immunohistochemical 
surrogates in the five major geographic regions in Brazil. 

Methods 

Patient selection 

All patient data were obtained from the files of Consultoria 
em Patologia, a private surgical pathology laboratory lo- 
cated in the city of Botucatu, State of Sao Paulo, Brazil. 
Consultoria em Patologia is a reference laboratory and ana- 
lyzes over 5,000 breast cancer cases per year. These cases 
usually are sent by pathologists and oncologists from all 
five geographic regions of Brazil in order to be evaluated 



for predictive and prognostic immunohistochemical 
markers, i.e., estrogen and progesterone receptors (ER 
and PR), human epidermal growth factor receptor 2 
(HER2) and Ki-67 proliferation index. Age at diagnosis 
and the state of origin of the patients were obtained 
from the pathological report. Geographic regions of 
Brazil were classified as North, Northeast, Central-West, 
South, and Southeast. We selected consecutive cases of 
invasive breast cancer from July 2009 to March 2011 
with assessable immunohistochemical study of ER, PR, 
HER2 and Ki-67. One of the reasons this period of time 
was chosen is that all these cases had the immunohisto- 
chemistry study performed using exactly the same 
immunohistochemical protocol. Another reason was 
that we included only the cases sent for routine prog- 
nostic and predictive factors determination. We excluded 
in situ and microinvasive carcinomas as well the cases, 
which were sent for second opinion. 

Immunohistochemistry analysis 

ER, PR, and HER2 status and Ki-67 proliferation index 
were determined at the time of the patient's cancer 
assessment by an immunohistochemical method on a 
selected tumor block. ER and PR were considered 
positive with >1% of the nuclear staining in tumor 
cells, although in all cases except in three, ER and PR 
results showed > 10% of positive cells. HER2 was con- 
sidered positive with a 3+ score and negative with a 0+ or 
1+ immuno reactivity using the previous ASCO/CAP rec- 
ommendation [20]. Also, based on ASCO/CAP recom- 
mendations, breast cancers with 2+ immunohistochemical 
scores were evaluated for fluorescence in situ hybridization 
(FISH), and a ratio of >2.2 of HER2 gene to chromosome 
17 was considered positive for HER2 gene overexpression; 
ratio <1.8 was considered negative for HER2 amplification, 
and cases with a ratio between 1.8 and 2.2 were classified 
as equivocal for HER2 amplification [20]. The Ki-67 
proliferation index was determined in the area with the 
highest Ki-67 nuclear labeling. A total of 300 proliferating 
and nonproliferating cells were counted, and the per- 
centage of proliferating cells was calculated. Table 1 
summarizes the specifications of the primary antibodies 
used. 

Molecular subtype classification 

Immunohistochemical surrogate markers (ER, PR, HER2, 
Ki-67, EGFR and CK5/6) were used to determine the mo- 
lecular category of breast cancer as previously described 
in the following groups: luminal A (ER + and/or PR+, 
HER2- and Ki-67 < 14%), luminal B (ER + and/or PR+, 
HER2- and Ki-67 > 14%), triple-positive (TP) (ER + and/or 
PR+, HER2+), HER2-enriched (ER- and PR- and HER2+), 
and triple-negative (TN) (ER-, PR- and HER2-) [21,22]. 
HER2+ included all the cases with a score of 3+ by 
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Table 1 Source and dilutions of the antibodies and epitope retrieval methods used in this study 



Antibody to 


Clone 




Source 


Dilution 


Epitope retrieval 


Estrogen receptor 


Rabbit monoclonal 


antibody, SPl 


Tiiermo Scientific 


1:500 


Pressure cooker, 9 min 


Progesterone receptor 


Mouse monoclonal 


antibody, PgR636 


Dako 


1:1000 


Pressure cooker, 9 min 


HER2 


Rabbit monoclonal 


antibody, SP3 


Tiiermo Scientific 


1:100 


Microwave oven 


Ki-67 


Mouse monoclonal 


antibody, IVilBl 


Dako 


1:600 


Pressure cooker, 9 min 



HER2: human epidermal growth factor receptor 2; Pressure cooker: citrate buffer (pH 6) (Tender Cooker, Nordic Wave, USA). Microwave: citrate buffer (pH 6), 
15 min (Electrolux*, 900 W). 



immunohistochemistry and cases with score 2+ but 
showing amplification demonstrated by FISH according 
to the guideUnes of The American Society of Chnical 
Oncology (ASCO) and the College of American Pathologists 
(CAP) [20]. 

Statistical analysis 

Descriptive statistical analysis was used to characterize 
the distribution of the patient's age at diagnosis, hormo- 
nal receptor status, HER2 status, and molecular subtypes 
for the total sample and by the geographic region. Com- 
parisons of the age of patients between different geo- 
graphic regions, molecular subtypes, ER/PR status, and 
HER2 status were performed using the Kruskal-Wallis 
test. Associations between molecular subtypes, hormonal 
receptor status, HER2 expression, and categories of 
patient's age with geographic regions were tested by a 
chi-square test. Missing values were not included in 
our statistical analysis and were deleted list-wise. Statis- 
tical analyses were performed using MedCalc for Win- 
dows (version 11.5.0.0; MedCalc Software, Mariakerke, 
Belgium), and a p-vahie less than 0.05 was considered 
significant. 



Institutional approval 

The study was approved by the Scientific Committee of 
the Department of Pathology of the Faculdade de Medicina 
da Universidade de Sao Paulo, and also by the Ethical 
Committee for Research Projects of the Hospital das 
Clinicas da Faculdade de Medicina da Universidade de 
Sao Paulo (CAPPesq) (protocol 311/10). 

STROBE statement 

This study has adhered to the STROBE guidelines for 
observational studies. 

Results 

In total, 5,687 eligible cases were included in the final 
analysis. The age of patients ranged from 16 to 98 years 
(mean 55.5 ± 13.5 years, median = 54 years). The distri- 
bution of age at diagnosis, hormonal and HER2 status, 
and the molecular subtypes are summarized in Table 2. 
In regional distribution, the age at diagnosis had the 
lowest mean in the North and Central- West regions and 
among patients with negative ER/PR, and HER2- posi- 
tive tumors (Tables 2 and 3). Among molecular subtypes, 
however, the lowest mean age was seen among patients 



Table 2 Comparison of age of patients at breast cancer diagnosis, molecular subtype, expression of estrogen and 
progesterone receptor, and HER2 status among the five Brazilian geographic regions 



Variable 



Brazilian geographic regions 



p-value 







SE 


S 


NE 


CW 


N 


Age 


(mean±SD) 


56.1 ±13.5 


55.8 ± 13.9 


55.3 ± 13.7 


543 ± 1 2.8 


53.9 ± 13.1 


IVIolecular subtype 


Luminal A 


707 (28.8%) 


324 (30.8%) 


226 (24.1%) 


143 (25.9%) 


171 (25.3%) 




Luminal B 


971 (39.5%) 


388 (36.9%) 


348 (37.1%) 


1 90 (34.6%) 


208 (30.8%) 




Triple-positive 


238 (9.7%) 


114 (10.8%) 


100 (10.7%) 


71 (12.9%) 


68 (10.1%) 




HER2 


1 95 (7.9%) 


71 (6.7%) 


99 (10.5%) 


49 (8.9%) 


91 (13.5%) 




Triple-negative 


345 (14.0%) 


154 (14.6%) 


163 (17.4%) 


96 (17.4%) 


1 37 (20.3%) 




Non-available^ 


4 


5 


3 


2 


3 


ER/PR 


Positive 


2039 (82.8%) 


871 (82.5%) 


753 (80.0%) 


418 (75.9%) 


529 (78.0%) 




Negative 


422 (17.1%) 


185 (17.5%) 


188 (20.0%) 


133 (24.1%) 


149 (22.0%) 


HER2 


Positive 


433 (17.6%) 


188 (17.8%) 


201 (21.3%) 


119 (21.6%) 


1 63 (24.0%) 




Negative 


1972 (80.1%) 


846 (80.1%) 


725 (77.0%) 


422 (76.6%) 


506 (74.6%) 




Score2+ 


56 (2.3%) 


22 (2.1%) 


15 (1.6%) 


11 (2.0%) 


9 (1 .3%) 


Total 




2461 


1056 


941 


551 


678 



<0.0001" 
<0.000l'' 



0.0003° 



0.017" 



''Kruskal-Wallis; ''chi-square; 'HER2 score 2+ and no FISH study to define the HER2 status. 
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Table 3 Comparison of age among different geographic regions, molecular subtypes, ER/PR status, and HER2 status 



Factor 




n 


Mean ± SD (years) 


Significant difference (<0.05) from other factor(s)° 


Geographic Region 


(1) CW 


551 


54.3 


(5) 




(2) N 


678 


53.9 


(3)(4)(5) 




(3) NE 


941 


55.3 


(2) 




(4)S 


1056 


55.8 


(2) 




(5) SE 


2461 


56.1 


(1)(2) 


IVIolecular subtype 


(1) HER2 


505 


54.2 ± 1 2.8 


(2)(5) 




(2) LUM A 


1571 


58.5 ±13.0 


(1)(3)(4)(5) 




(3) LUM B 


2104 


55.7 ± 13.7 


(2)(4)(5) 




(4)TN 


894 


53.1 ±134 


(2)(3)(5) 




(5) TP 


591 


51.6 ± 13.0 


(1)(2)(3)(4) 


ER/PR status 


(1) Positive 


4610 


55.6 ±13.5 


(2) 




(2) Negative 


1077 


534 ±134 


(1) 


HER2 status 


(1) Positive 


1104 


52.8 ±12.9 


(2) 




(2) Negative 


4470 


56.1 ±13.6 


(1) 


''Kruskal-Wallis test, p < 0.0001 for all variables. 



with a TP phenotype followed by TN carcinomas (Table 3). 
The distribution of the molecular subtypes differed in the 
five regions, which is illustrated in Figure 1. Luminal A 
carcinomas were more frequent in the Southeast (28.8%) 
and South (30.8%) regions. The highest proportion of 
luminal B carcinomas (39.5%) was seen in the Southeast 
region {p < 0.0001). Actually, the Southeast and South re- 
gions presented the highest proportion of ER/PR-positive 
tumors (p = 0.0003) (Table 2). HER2-enriched tumors 
were most frequent in the North region. The highest pro- 
portion of TP carcinomas came from the Central- West re- 
gion. Triple negative carcinomas were more prevalent in 
the North region {p < 0.0001) (Table 2). The differences of 
frequency of the molecular subtypes in the five geographic 
regions persisted when we analyzed the subgroups of 
patients under 51 years (p = 0.0001) and over 50 years 
{p = 0.0012) (Table 4). 

Table 5 summarizes the distribution of our cases, the 
official data of the Brazilian female population, and the 
incidence of breast cancer in the five geographical re- 
gions. Figure 1 illustrates the distribution of all the cases 
in this study according to the geographical regions. 



Discussion 

Brazil is a large country with a surface area of 8,51 1,960 km^ 
and with a population of 190,755,799. It is the most 
populous country in Latin America as well as one of 
the most populous in the world. The female popula- 
tion, according to the official 2010 census, is 97,348,809 
[15]. We found significant differences in the proportion of 
the molecular subtypes of breast cancer between the five 
Brazilian geographic regions — each one with a distinct 
racial composition together with differences in climate. 



nutritional habits, urbanization and socioeconomic 
conditions. 

The Brazilian racial composition is very heterogeneous 
with most populations descending from Europeans, 
Africans, Amerindians of various ethnic groups, and 
Asians. Most importantly, Brazil has experienced a 
high rate of intermarriage, and consequently, a mixed- 
race population has been created along five centuries 
of admixture. 

A racial/ethnic influence on the presentation of breast 
cancer has been suggested in various studies [7,14,23-26] 
although race definition was neither very clear nor uni- 
form in these studies. Most investigated race as a social 
characteristic without biologic basis of classification. 
Carey et al. conducted a population-based, case-controlled 
study with cases from the Carolina Breast Cancer Study 
and observed a higher prevalence of basal-like among 
premenopausal African Americans (39%) compared with 
postmenopausal African Americans (14%) and non- African 
American patients of any age (16%) [7]. Kurian et al. dem- 
onstrated different lifetime risks of the distinct subtypes 
defined by the ER, PR, and HER2 status among black, 
Hispanic, Asian, and white women [26] . The authors found 
a higher lifetime risk of TN tumors in black women and 
discussed the importance of this data on health policy and 
resource planning. 

The high prevalence of basal-like tumors in premeno- 
pausal African Americans is pointed to as an important 
factor to the poorer prognosis in this group of patients 
[27,28], justifying the higher mortality rate despite the 
lower incidence [13,14]. Harper et al. documented 31/ 
100,000 annual deaths among African Americans com- 
pared with 27/100,000 annual deaths for white women 
[29]. Delay in diagnosis due to disparities in healthcare 
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North Region 

(n = 678 cases, 

LumA=17l (25.3%) 
Lum B = 208 (30.8%) 
HER2 = 91 (13.5%) 
TP = 68(10.1%) 
TN = 1 37 (20.3%) 




Lum B= 190 (34. 
HER2 = 49 (8, 
TP = 71 (12.9%) 
TN = 96(17. 



Northeast Region 
(n = 941 cases) 

Lum A = 226 (24.1%) 
Lum 8 = 348 (37.1%) 
HER2 = 99(10.5%) 
TP =100 (10.7%) 
TN = 163 (17.4%) 



Southeast Region 



(n = 2461 cases) 

Lum A = 707 (28.8%) 
Lum B = 971 (39.5%) 
HER2 = 1 95 (7.9%) 
TP = 238 (9.7%) 
TN= 345 (14.0%) 



(n = 1056 cases) 



Legend 



Lum A = Luminal A 
Lum B = Luminal B 
HER2 = HER2 

TP = Triple-positive (ER/PR/HER2 positive) 
TN = Triple-negative (ER/PR/HER2 negative) 



Lum A = 324 (30.8%) 
Lum B = 388 (36.9%) 
HER2 = 71 (6.7%) 
TP = 114(10.8%) 
TN = 154 (14.6%) 



Figure 1 Brazilian map with the 5 geographic regions showing the distribution of 5,687 cases of breast cancer according to molecular 
subtypes determined by immunohistochemical surrogates. 



Table 4 Comparison of the molecular subtypes among the five Brazilian geographic regions according to age of 
patients at diagnosis 



Geographic Molecular subtypes 



regions 


Luminal A 




Luminal B 




Triple-positive 


HER2 




Triple-negative 


Age at 


<50y 


>50y 


<50y 


>50y 


<50y 


>50y 


<50y 


>50y 


<50y 


>50y 


diagnosis 






















Southeast 


188/810 


519/1651 


332/810 


639/1651 


94/810 


144/1651 


60/810 


135/1651 


135/810 


210/1651 




(23.2%) 


(31.4%) 


(41 .0%) 


(38.7%) 


(1 1 .6%) 


(8.7%) 


(74%) 


(8.2%) 


(167%)) 


(12.7%) 


South 


91/387 


233/669 


1 50/387 


238 /669 


57/387 


57/669 


25/387 


46/669 


63/387 


91/669 




(23.5%) 


(34.8%) 


(38.7%) 


(35.6%) 


(14.7%) 


(8.5%) 


(6.5%)) 


(6.9%) 


(16.3%)) 


(1 3.6%) 


Northeast 


58/348 


1 68/592 


1 30/348 


217/592 


58/348 


42/592 


37/348 


62/592 


64/348 


99/592 




(1 6.7%) 


(284%) 


(37.3%) 


(36.7%) 


(167%) 


(7.1%) 


(10.6%)) 


(1 0.5%) 


(18.4%)) 


(16.7%) 


Central 


43/218 


1 00/332 


80/218 


110/332 


37/218 


34/332 


19/218 


30/332 


39/218 


56/332 


West 


(1 9.7%) 


(30.1%) 


(36.7%) 


(33.1%) 


(1 7.0%) 


(10.2%) 


(8.7%)) 


(9.0%) 


(17.9%) 


(16.9%) 


North 


36/262 


135/415 


90 7262 


118/415 


28/262 


40/415 


35/262 


56/41 5 


72/262 


65/415 




(1 3.7%) 


(32.5%) 


(34.3%) 


(28.4%) 


(10.7%) 


(9.6%) 


(13.4%) 


(13.5%) 


(27.5%)) 


(1 5.6%) 



Chi square test, patients <50y, p = 0.0001; Chi-square test, patients >50y, p = 0.001. 
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Table 5 Distribution of 5,687 cases compared to overall and female Brazilian population among the five geographic 



regions, and incidence of breast cancer/100,000 women among females 


Geographic Regions 


Total population^ 


Total female population^ 


Incidence of breast cancer/100,000'' 


Our cases 


North 


1 5,864,454 (8%) 


7,859,539 (8.1%) 


21 


678 (12%) 


Northeast 


53,081,950 (27%) 


27,172,904 (27.9%) 


37 


941 (16%) 


Southeast 


80,364410 (42%) 


41,287,763 (42.4%) 


71 


2,461 (43%) 


South 


27,386,891 (14%) 


13,950,480 (143%) 


71 


1,056 (18.6%) 


Central West 


14058,094 (7%) 


7,078,123 (7.2%) 


51 


551 (9%) 


Total 


190,755,799 


97,348,809 


56 


5,687 



''Data from Census 2010 IBGE [15]; "Data from Brazilian National Cancer Institute (INCA) [19]. 



access may contribute to higher rates of late-stage 
presentation and partially explain the increased rate of 
annual death in African Americans. In this study, we 
did not evaluate the stage or prognostic or outcome — 
factors influenced by social-economic status — but only 
the molecular subtype as a variable in geophysical dis- 
tribution. In Brazil, besides the complexity of racial 
composition, the five geographic regions differ from each 
other regarding climate, urbanization, socioeconomic sta- 
tus, and nutritional habits — conditions that potentially 
can interfere in carcinogenesis as well as in the diagnosis 
and outcome of breast cancer. 

In our study, Northern Brazil presented the highest pro- 
portion of TN carcinomas (20.3%). This region, largely 
covered by the Amazon rainforest, presents the largest 
Amerindian influences, both in culture and ethnicity. 
Indians represent 0.4% of the total Brazilian population 
[19], but they make up 1.5% of the Northern region. 
This region also has the highest African influence 
(77.8%), compared to 66.8% in Northeast, 65.9% in 
Central- West, 43.8% in Southeast, and 22.8% in South [15]. 

We must consider, however, the difficulty in defining 
race in Brazil and the need of a genomic classification. 
Pena et al. estimated the proportion of European, African, 
and Amerindian ancestry using a panel of 40 validated 
ancestry-informative insertion-deletion DNA polymor- 
phisms in a population composed by black, white, and 
brown people from the four most populous Brazilian 
geographic regions (North, South, Northeast and South) 
[18]. The results were surprising as the authors were 
unable to demonstrate statistically significant differences 
among the four studied regions [18]. The authors found 
that European ancestry was predominant in all regions, 
varying from 60.6% in the Northeast to 77.7% in the 
South. Besides the difficulty to determine race in Brazil, 
these data show an important difference between the so- 
cial and genomic determination of the African influence, 
particularly among the North population. The large area 
of Brazil has a variety of climate zones: equatorial, tropical, 
semi-arid, highland tropical and subtropical, each one 
with different life style, in part determined by the contri- 
bution of different ethnical groups, part by the substantial 



differences in the industrialization degree and social 
development. 

The South region of Brazil houses the largest percent- 
age of European descents, including German, Italian and 
Polish ancestry. White races comprise 80.6% of the 
population in this region compared to 24% in the North, 
29.5% in the Northeast, 58.5% in the Southeast and 
43.5% in the Central-West. According to our results, the 
South region showed the highest frequency of luminal A 
carcinomas — the more favorable molecular subtype — as 
well as a higher frequency of ER/PR-positive tumors, 
which is consistent with other studies [7,30,31]. Accord- 
ing to the Brazilian National Institute of Cancer (INCA), 
Rio Grande do Sul, a state in the Southern region with a 
great influence of Italians and Germans boasts the highest 
incidence of breast cancer with 71 cases/ 100,000 women 
[19], which is consistent with other studies [14]. Our re- 
sults indicated that most of our breast cancers located in 
this geographic region are a luminal A subtype and simul- 
taneously presented with the lowest rate of HER2, TP, and 
TN tumors, suggesting an overall better prognosis. 

The second region with a high proportion of luminal 
A and ER/PR-positive carcinomas was the Southeast, 
which was also the region with the second largest white 
population. The lowest incidence of luminal A was doc- 
umented in the North, which presented a rate of breast 
cancer varying from 10 cases/100,000 women in the state 
of Acre to 26 cases/ 100,000 women in the state of 
Rondonia [19]. According to INCA, the only state of 
the North with a discrepant higher rate of breast cancer 
is Tocantins, which is located near the Central-West 
region, with 27 cases/100,000 women [19]. 

In addition to luminal A carcinomas, the luminal B cat- 
egory also was more prevalent in the South and Southeast 
and showed a lower frequency in the North region. It is 
important to stress that our luminal B did not include 
HER2-coexpression, like some authors have classified 
[7,11]. We analyzed the TP phenotype separately using 
classification criteria suggested by the previous St. Gallen 
consensus [32]. The TP phenotype was more frequent in 
the Central- West region, while HER2-enriched carcinomas 
were more prevalent in the North. In fact, our results 
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indicated that more aggressive molecular subtypes are 
more prevalent in the North. This information must be 
considered in the planning of breast cancer prevention 
programs. 

The distribution of HER2 carcinomas among different 
races/ethnic groups is more difficult to analyze because 
some authors included TP within the HER2 group [26], 
and others, within luminal B [7,11,13,30]. Fewer studies 
analyzed the group separately as we did. Kurian et al. 
did not find differences in the distribution of HER2- 
carcinomas, but they included TP among their HER2 
tumors [26]. 

The differences in molecular subtypes that we ob- 
served among the five geographic regions persisted in 
the subgroups of patients under and over 50 years, sug- 
gesting the existence of a geographic phenomenon more 
than effect of age itself. Similar results have been reported 
by others [30]. Clarke et al. analyzed the distribution of 
the breast cancer subtypes among 91,908 patients from 
California and observed that black-women had higher 
rates of triple-negative carcinomas at all ages. The North 
region presented the lowest mean of age, and it was statis- 
tically different when compared to the Northeast, South, 
and Southeast. 

Although we found significant differences in the distri- 
bution of molecular subtypes between the five geographic 
regions, we could not associate them to African ancestry, 
as other studies have been able to do in primarily the 
USA, despite the important African influence in the 
Brazilian racial composition. The reasons for this find- 
ing remain to be explored. One possible explanation is 
that the influence of African ancestry in Brazil may be 
different from that seen in the USA and Europe due to 
high rate of interracial marriage seen in Brazil. How- 
ever, as we discussed above, there are other important 
factor to be considered. 

There are studies that explored some genetic condi- 
tions implicated with racial/ethnic differences in breast 
cancer. Wang et al. observed a significantly higher 
methylation of the suppressor gene CDH13 in breast tumor 
samples from African-American women when compared 
with European- American patients. The hypermethylation of 
the CDH13 gene probably is related to early onset ER- 
negative breast cancer [33]. Several single-nucleotide 
polymorphisms (SNPs) have been associated to age and 
race of breast cancer patients [34]. Associations were 
observed for SNPs in FGFR2, LSPl, HI 9, TLR1/TLR6 
and RELN for African-Americans and in FGFR2, TNRC9, 
H19 and MAP3K1 for whites [34]. It is highly probable 
that the racial admixture can be responsible for some 
genomic differences, which are not necessarily similar 
to the ancestry. 

Health disparities, either by racial/ethnic, socioeconomic, 
cultural, climatic, nutritional, or geographic are very 



complex to decode since there is a significant overlap 
between these factors [35]. Understanding the differences 
in breast cancer characteristics between geographic re- 
gions is the first step to organize healthcare programs. 
Moreover, this knowledge has important impact for the 
design and interpretation of clinical trials. As such, we be- 
lieve that our study is relevant to determining the best 
strategy of health care and to better understand the tumor 
biology of breast cancer in large countries like Brazil with 
diverse economic and race composition among different 
geographic regions. 

Conclusions 

The distribution of molecular subtypes of breast cancer 
differs by geographic region in Brazil, a country of 
continental dimension and a wide racial variation of 
people. These differences are multifactorial and must 
be taken into account for public health policy. 
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